CS261: A Second Course in Algorithms Lecture #12: Applications of Multiplicative Weights to Games and Linear Programs∗

نویسنده

  • Tim Roughgarden
چکیده

1 Extensions of the Multiplicative Weights Guarantee Last lecture we introduced the multiplicative weights algorithm for online decision-making. You don't need to remember the algorithm details for this lecture, but you should remember that it's a simple and natural algorithm (just one simple update per action per time step). You should also remember its regret guarantee, which we proved last lecture and will use today several times as a black box. 1 Theorem 1.1 The expected regret of the multiplicative weights algorithm is always at most 2 √ T ln n, where n is the number of actions and T is the time horizon. Recall the definition of regret, where A denotes the action set: max a∈A T t=1 r t (a) best fixed action − T t=1 r t (a t) our algorithm. The expectation in Theorem 1.1 is over the random choice of action in each time step; the reward vectors r 1 ,. .. , r T are arbitrary. The regret guarantee in Theorem 1.1 applies not only with with respect to the best fixed action in hindsight, but more generally to the best fixed probability distribution in * c 2016, Tim Roughgarden. 1 This lecture is a detour from our current study of online algorithms. While the multiplicative weights algorithm works online, the applications we discuss today are not online problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CS261: A Second Course in Algorithms Lecture #11: Online Learning and the Multiplicative Weights Algorithm∗

For example, in job scheduling problems, one often thinks of the jobs as arriving online (i.e., one-by-one), with a new job needing to be scheduled on some machine immediately. Or in a graph problem, perhaps the vertices of a graph show up one by one (with whatever edges are incident to previously arriving vertices). Thus the meaning of “one piece at a time” varies with the problem, but it many...

متن کامل

Multiplicative Weights

In this lecture, we will study various applications of the theory of Multiplicative Weights (MW). In this section, we briefly review the general version of the MW algorithm that we studied in the previous lecture. The following sections then show how the theory can be applied to approximately solve zero-sum games and linear programs, and how it connects with the theory of boosting and approxima...

متن کامل

CS261: A Second Course in Algorithms Lecture #10: The Minimax Theorem and Algorithms for Linear Programming∗

Recall rock-paper-scissors (or roshambo). Two players simultaneously choose one of rock, paper, or scissors, with rock beating scissors, scissors beating paper, and paper beating rock. Here’s an idea: what if I made you go first? That’s obviously unfair — whatever you do, I can respond with the winning move. But what if I only forced you to commit to a probability distribution over rock, paper,...

متن کامل

CS261: A Second Course in Algorithms Lecture #20: The Maximum Cut Problem and Semidefinite Programming∗

Now that you’re finishing CS261, you’re well equipped to comprehend a lot of advanced material on algorithms. This lecture illustrates this point by teaching you about a cool and famous approximation algorithm. In the maximum cut problem, the input is an undirected graph G = (V,E) with a nonnegative weight we ≥ 0 for each edge e ∈ E. The goal is to compute a cut — a partition of the vertex set ...

متن کامل

CS261: A Second Course in Algorithms Lecture #6: Generalizations of Maximum Flow and Bipartite Matching∗

1 Fundamental Problems in Combinatorial Optimization Figure 1: Web of six fundamental problems in combinatorial optimization. The ones covered thus far are in red. Each arrow points from a problem to a generalization of that problem. We started the course by studying the maximum flow problem and the closely related s-t cut problem. We observed (Lecture #4) that the maximum-cardinality bipartite...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016